Finding Public Opinion Manipulation Trolls in Bulgarian Online News Media

نویسندگان

  • Todor Borisov Mihaylov
  • Ivan Koychev
  • Todor Mihaylov
چکیده

With the rise of social media, it became normal for people to read and follow other users' opinion. This created the opportunity for corporations, governments and others to distribute rumors, misinformation, and speculation and to use other dishonest practices to manipulate public opinion (Derczynski and Bontcheva , 2014). They could consistently use trolls (Cambria, Chandra and Sharma , 2010), write fake posts and comments in public forums, thus making veracity one of the challenges in digital social networking (Derczynski and Bontcheva , 2014). During the recent popular protest in Bulgaria in 2013 1 , social networks and news community forums became the main " battle grounds " between supporters and opponents of the government. In that period, there was notable censorship in the media, and many people who lived outside the capital did not really know what was actually happening. Moreover, there was a very notable presence of government supporters in Web forums. In series of leaked documents in the independent Bulgarian media Bivol, it became clear that the ruling party was using European parliament money to pay for hiring Internet trolls 2 3. In our work, we aim to find troll users and troll users' comments using several machine learning and natural language processing techniques. We first collect data about user profiles, comments and publications from the largest online media forum-that of Dnevnik.bg, where there is a notable presence of troll users. We build a Web crawler and several data extraction helpers for retrieving Web content and extracting structured data from the HTML content. We save the retrieved data in a relational database that reflects the natural data structure presented in the online media forum. We then use several database queries to retrieve statistical information to build user profiles and to retrieve users' comments as text and metadata. Using information and assumptions about troll vs. non-troll user behavior, we extract both statistics and text features and we build two classifiers using several configurations: one trained for finding troll vs. non-troll users and one for finding troll vs. non-troll comments. Our classifiers perform robust with accuracy of 90-96% for the user profiles' classifier and accuracy of 80-84% for the troll's comments classification. As a result of this work we have two independent papers accepted to CoNLL 2015 and RANLP 2015.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Finding Opinion Manipulation Trolls in News Community Forums

The emergence of user forums in electronic news media has given rise to the proliferation of opinion manipulation trolls. Finding such trolls automatically is a hard task, as there is no easy way to recognize or even to define what they are; this also makes it hard to get training and testing data. We solve this issue pragmatically: we assume that a user who is called a troll by several people ...

متن کامل

Disinformation Warfare: Understanding State-Sponsored Trolls on Twitter and Their Influence on the Web

Over the past couple of years, anecdotal evidence has emerged linking coordinated campaigns by state-sponsored actors with efforts to manipulate public opinion on the Web, often around major political events, through dedicated accounts, or “trolls.” Although they are often involved in spreading disinformation on social media, there is little understanding of how these trolls operate, what type ...

متن کامل

Analyzing the Digital Traces of Political Manipulation: The 2016 Russian Interference Twitter Campaign

Until recently, social media was seen to promote democratic discourse on social and political issues. However, this powerful communication platform has come under scrutiny for allowing hostile actors to exploit online discussions in an attempt to manipulate public opinion. A case in point is the ongoing U.S. Congress investigation of Russian interference in the 2016 U.S. election campaign, with...

متن کامل

Hunting for Troll Comments in News Community Forums

There are different definitions of what a troll is. Certainly, a troll can be somebody who teases people to make them angry, or somebody who offends people, or somebody who wants to dominate any single discussion, or somebody who tries to manipulate people’s opinion (sometimes for money), etc. The last definition is the one that dominates the public discourse in Bulgaria and Eastern Europe, and...

متن کامل

Exposing Paid Opinion Manipulation Trolls

Recently, Web forums have been invaded by opinion manipulation trolls. Some trolls try to influence the other users driven by their own convictions, while in other cases they can be organized and paid, e.g., by a political party or a PR agency that gives them specific instructions what to write. Finding paid trolls automatically using machine learning is a hard task, as there is no enough train...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2016